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ABSTRACT 

We study the evolution of the number density, as a function of the size, of passive early-type galaxies 
with a wide range of stellar masses (lO^^M© < M^. < from z 3 to z ^ 1, exploiting the 

unique dataset available in the GOODS-South field, including the recently obtained WFC3 images 
as a part of the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS). In 
particular, we select a sample of ~107 massive (M* > passive (SSFR < 10~^Gyr~^) and 

morphologically spheroidal galaxies at 1.2 < z < 3, taking advantage of the panchromatic dataset 
available for GOODS, including VLT, CFHT, Spitzer, Chandra and HST ACS+WFC3 data. We find 
that at 1 < z < 3 the passively evolving early-type galaxies are the reddest and most massive objects 
in the Universe, and we prove that a correlation between mass, morphology, color and star-formation 
activity is already in place at that epoch. We measure a significant evolution in the mass-size relation 
of passive early-type galaxies (ETGs) from z 3 to z ~ 1, with galaxies growing on average by a 
factor of 2 in size in a 3 Gyr timescale only. We witness also an increase in the number density of 
passive ETGs of 50 times over the same time interval. We find that the first ETGs to form at z > 2 
are all compact or ultra-compact, while normal sized ETGs (meaning ETGs with sizes comparable 
to those of local counterparts of the same mass) are the most common ETGs only at z < 1. The 
increase of the average size of ETGs at < z < 1 is primarily driven by the appearance of new large 
ETGs rather than by the size increase of individual galaxies. 

Subject headings: cosmology: observations — galaxies: fundamental parameters — galaxies: evolution 



1. INTRODUCTION 

Early-type galaxies have been the objects of many 
studies in the recent years, as they contain crucial infor- 
mation about the evolution of the galaxy content. First, 
they are by definition the oldest objects at each epoch, 
and they can be considered relics of the star formation 
activity that happened in the past; the age of the stellar 
populations in these galaxies at z are compatible 
with redshifts of formations z > 2 (Renzini 2006). Sec- 
ond, early-type galaxies are the most massive objects in 
the local Universe, containing the bulk of the stellar mass 
. (Baldry et al. 2004). 
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Understanding the history of the assembly of these 
galaxies throughout cosmic time is crucial to constrain 
models of galaxy evolution. The first generation of ex- 
tremely massive ETGs {Mq > 10^^) is already in place 
at z ~ 2.5 (Guo et al. 2012); their number density 
dramatically increases during the 1 < z < 3 epoch 
(Ilbert-l-10, Ilbert-l-13, Cassata-|-ll, Brammer-f-11), fol- 
lowed by a milder evolution at < z < 1 (Bell et al. 2004; 
Faber et al. 2007; Pozzetti et al. 2010). The over- 
all evolution of the ETGs depends strongly on stellar 
mass, following a "downsizing" pattern: more massive 
ETGs build up preferentially earlier than the less mas- 
sive ones (Arnouts et al. 2007; Marchesini et al. 2009; 
Ilbert et al. 2010, 2013; Cassata et al. 2011, Bram- 
mer et al. 2011). 

At the same time, the massive passive ETGs are found 
to have 3-5 times smaller sizes at z > 1 than in the 
local Universe (Daddi et al. 2005; Trujillo et al. 2007; 
Toft et al. 2007; Zirm et al. 2007; Cimatti et al. 2008; 
Van Dokkum et al. 2008; Buitrago et al. 2008), and 
thus they are 30-100 times denser. These results ig- 
nited a debate about the possible mechanism producing 
the size evolution: Naab, Johansson & Ostriker (2009) 
claim that the observed size evolution of passive ETGs 
can be explained by minor merger events, while Khoch- 
far & Silk (2006) propose that the observed evolution can 
be explained by the variation of the amount of cold gas 
available during the major merger events that produce 
ETGs: the most massive ones formed at high-z, when 
major mergers were more gas rich than at later epochs. 
Although Lopez-Sanjuan et al. (2012) claimed that the 
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observed minor mergers at < z < 1 can account for up 
to 55% of the size growth of ETGs, Nipoti et al. (2012) 
claimed that minor and major mergers with spheroids 
are not sufficient to explain the observed size evolution 
between z ^ 2.2 and z ^ 1.3. 

From a theoretical point of view, some key questions on 
the formation and evolution of ETGs are still open. We 
still do not know through which mechanism the ETGs 
accrete their mass: is it through major mergers at early 
epochs, as it is predicted by models of galaxy forma- 
tion (Shankar et al. 2010; Shankar et al. 2011) or is it 
through the collapse of unstable disks, that have been 
shown to be numerous at z ^ 2 (Genzel et al. 2008; 
Forster-Schreiber et al. 2009)? What is the mechanism 
that shuts off the star formation in such objects, turning 
them into passively evolving ETGs? The dependence of 
the quenched ETG population on stellar mass and envi- 
ronment has been recently elucidated on phenomenolog- 
ical grounds (Peng et al. 2010, 2012), but what remain 
to be understood are the concrete physical mechanisms 
responsible for what is referred to as mass quenching and 
environment quenching in these studies. 

In this paper we take advantage of the wealth of data 
available in the Chandra Deep Field South field, mainly 
gathered as a part of the GOODS and CANDELS sur- 
veys, to select a robust sample of passive ETGs at 
1.2 < z < 3, that we use to constrain the assembly 
of their mass content as function of the size, comple- 
menting and completing the analysis performed in Cas- 
sata et al. (2011; Cll hereafter). In particular, we dig 
into the passive population down to — VS^^Mq up 
to z ~ 3, a mass regime between 5 and 10 times below 
M* (Ilbert et al. 2010). 

Throughout the paper, we use a concordance cosmo- 
logical model (I^a — 0.7, r2„i — 0.3 and iJo=70 km s^^ 
Mpc~^, we assume a Salpeter initial mass function (IMF; 
Salpeter et al. 1955) and we use AB magnitudes. 

2. DATA AND SAMPLE SELECTION 

The GOODS-South field is the one of the best stud- 
ied parts of the sky, having been imaged with all the 
largest available telescopes (Hubble, Spitzer, VLT, Chan- 
dra, XMM, Herschel, CFHT). The GOODS HST Trea- 
sury Program (Giavalisco et al. 2004) provides ultra- 
deep high-resolution images in B-, V-, i- and z-bands. 
Deep ground-based imaging in the U-band is provided 
by VIMOS/VLT for the CDFS (Nonino et al. 2009). 
Moreover, VLT/ISAAC imaged the CDFS in J-, H- 
and K-bands. Ultradeep Spitzer/IRAC imaging is also 
available in the 3.6, 4.5, 5.6 and 8.0 /xm MIR chan- 
nels and in the 24fim FIR one. The field has also 
been recently imaged with Herschel/PACS at 100 and 
160 fim as a part of the GOODS/Herschel program 
(Elbaz et al. 2011). About 3000 spectroscopic red- 
shifts are also available, among which 1200 are at 
z > 1 (Cimatti et al. 2008; Vanzella et al. 2008; 
Popesso et al. 2009; Kurk et al. 2013). 

The Cosmic Assembly Near-infrared Deep Extragalac- 
tic Legacy Survey (CANDELS; Grogin et al. 2011; 
Koekemoer et al. 2011), the largest HST campaign ever 
undertaken, is collecting high resolution images in the 
F105W (Y-band), F125W (J-band) and F160W (H- 
band) filters for 5 of the most intensively studied extra- 
galactic fields: namely, GOODS-South and North, ECS, 
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Fig. 1.— Stellar mass vs. AB magnitude in the WFC3/F160W 
H-band filter in four redshift bins: 1.2 < z < 1.5, 1.5 < z < 
2, 2 < 2 < 2.5 and 2.5 < 2 < 3. The vertical dashed line at 
™-Fi60W = 25.5 indicates the magnitude at which the CANDELS 
H-band images are 80% complete, the dashed horizontal line at 
M/Mq = 10^° shows the mass limit of our sample, the red and 
blue diagonal lines indicate, for each redshift bin, the stellar mass 
vs H-band magnitude relation for the reddest and bluest models 
available in our template grid. 



UDS and COSMOS. In this paper we include the first 
4 epochs of CANDELS observations in GOODS-S, that 
cover the central ^ 80 arcniin^ of the GOODS-S field, 
for a total integration time of 2 orbits each in F125W 
(J-band) and F160W (H-band). We include as well the 
WFC3 observations taken as a part of the Early Release 
Science Program 2 (ERS2: GO 11359. PI: O'Connell; 
Windhorst et al. 2011), that cover an additional ^ 40 
arcmin^ area in the north part of the GOODS-S field, 
with integration times of 2 orbits in each of the F098M 
(Y-band), F125W (J-band) and F160W (H-band) fil- 
ters. The final mosaics in Y-, J- and H-band were as- 
sembled using MultiDrizzle (Koekemoer et al. 2002), 
combining the data to a 0.06" pixel grid, and produc- 
ing a PSF of ^0.16" in our resulting WFC3 images. The 
la fluctuations of the sky for the ERS2 regions are 27.2, 
26.6, and 26.3 AB arcsec^ in the Y, J and H bands, 
respectively; for the 4-epoch CANDELS region, the la 
fluctuations are 26.6 AB arcsec^ for both the J- and 
H-bands. 

We have built a multi-wavelength catalog using the 
TFIT procedures by Laidler et al. (2007), using the 
H-band as the detection image and including the U- 
band from VIMOS, the B, V, i and z bands from 
GOODS/ACS, the Y, J and H bands from CAN- 
DELS/ WFC3 and the Ks band from VLT/ISAAC 
(Guo et al. 2013, in prep). This procedure allows us 
to match the point-spread function of images having 
different resolutions and to obtain homogeneous aper- 
ture magnitudes from the different images. The catalog 
includes about 11,000 objects brighter than m_f/=25.5, 
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4500 of which are at 1.2 < z < 3, according to their 
spectroscopic or photometric redshift (see below). 

We included the spectroscopic redshifts available in lit- 
erature for the GOODS-S field (Vanzella et al. 2008; 
Cimatti et al. 2008; Popesso et al. 2009; Kurk et al. 2013) 
for 2232 objects, and we measured photometric redshifts 
for the remaining objects, using the PEGASE 2.0 tem- 
plates (Fioc & Rocca-Volmerange 1997), following the 
same procedure in Guo et al. (2012). 

We then fitted the SED from UV to 8 ^m to the up- 
dated version of Bruzual&Charlot (2003, CB07) in or- 
der to get accurate measurements of stellar mass, E(B- 
V), age and SFR of the galaxies. In particular, we use 
a Salpeter IMF (Salpeter 1955) with lower and upper 
masses of 0.1 and 100 Mq, we apply the Galzetti law 
(Calzetti et al. 2000) to describe the dust extinction and 
we assume exponentially declining star formation histo- 
ries SFR(t)cx e(-*/^), where t is age (i. e. the time 
passed since the peak of the star formation) and r is the 
characteristic time of the star formation event. Maraston 
et al. (2010) showed that these exponentially declining 
SFR models tend to overestimate the SFRs and under- 
estimate stellar masses for star-forming galaxies at Z'-^2, 
while truncated exponentially increasing models give bet- 
ter results; however, the efi'ect of such different SFH on 
passively evolving galaxies have not been conducted yet, 
and we defer this analysis to a forthcoming paper. In any 
event, these SED fits are only used to identify passively 
evolving galaxies. 

We extracted a "parent" catalog of 1051 galaxies with 
> 1O^°M0 and redshift z > 1.2, from which we se- 
lected only passive galaxies with Specific Star Formation 
Rate SSFR < IQ-^ Gyr^^ (332 galaxies). This SSFR 
limit is very restrictive: galaxies with SSFR = 10^^ 
Gyr~^ would need 100 Gyr to double their stellar mass, 
if they continue to form stars at the present rate. We 
then eliminated all galaxies with a late-type morphol- 
ogy in the H-band WFC3 image (based on visual in- 
spection, similarly to Cll) and we also excluded galaxies 
detected in the Spitzer/MIPS 24/xm and Herschel/PACS 
100/xm channels, ensuring that the SFR is lower than 
10 MQyr~^ (Elbaz et al. 2011): the resulting sample of 
passive ETGs contains 107 galaxies. For the remaining 
of the paper, we define "ETGs" galaxies that are pas- 
sive according to our criteria (SSFR and no detection 
in Herschel/Spitzer) and have a spheroidal morphology. 
84% and 92% of the resulting 107 passive morphologically 
spheroidal ETGs have sersic indices n > 2 and n > 2.5, 
respectively (similarly to Bell et al. 2012): this ensures 
that our sample is not contaminated by quiescent disks 
as in van der Wei et al. (2011). 

We used the GALFIT package (Peng et al. 2002) to fit 
the light profile in the H band (matching the rest- 
frame optical up to z 3): 



I{r) = lecxp ■ 



l/n 



(1) 



where I(r) is the surface brightness measured at dis- 
tance r, /e is the surface brightness at the effective ra- 
dius Te and bn is a parameter related to the Sersic in- 
dex n. For n=l and n=4 the Sersic profile reduces re- 
spectively to an exponential and deVaucouleurs profile. 



Bulge dominated objects typically have high n values 
(e.g. n > 2) and disk dominated objects have n around 
unity. Ravindranath et al. (2006), Cimatti et al. (2008), 
Trujillo et al. (2007) and van der Wei et al. (2012) showed 
that GALFIT yields unbiased estimates of the Sersic in- 
dex and effective radius for galaxies with S/N> 10 and 
re > 0.03", independently of the redshift of the source, 
thus demonstrating that the surface brightness dimming 
is not an issue for this kind of study. 

The PSF was obtained in each passband needed by av- 
eraging well-exposed, unsaturated stars. We run GAL- 
FIT experimenting on various sizes of the fitting region 
around each galaxy, and with the sky either set to a pre- 
measured value or left as a free parameter. We verified 
that the sizes and Sersic indices do not vary by more 
than 10% in the various cases. The values that we show 
throughout the paper were obtained with a free sky and 
6x6 arcsec^ fitting regions. Any close-by object detected 
by SExtractor within each fitting region was automati- 
cally masked out during the fitting procedure. 

Figure [T] illustrates the completeness of our H-band 
selected catalog at different redshifts. In particular, we 
want to assess our ability to recover ETGs, which are 
very red because of the extremely constraining SSFR cut 
(SSFR< 10~^), as a function of the stellar mass and red- 
shift. The H-band image that we use in this work is 
80% complete at least down to magnitude 25.5 for typi- 
cal z - 2 spheroids with = 0.125" (Guo et al. 2013, 
in prep). We note that the 107 ETGs in our sample are 
detected at at more than lOcr in at least 4 filters blue- 
ward than the Balmer break, ensuring good photometric 
redshifts and mass determination. 

In Fig. [1] we plot the stellar mass versus H-band magni- 
tude for the reddest models in our grid (thus the hardest 
to detect), and we compare it with the distribution of 
our real ETGs at 1.2 < z < 3. As expected, all the real 
galaxies lie right of the reddest model, meaning that, for 
a given stellar mass, they are brighter than the model. 
Since the H-band imaging is 80% complete down to mag- 
nitude rriH ^ 25.5 (Guo et al. 2013, in prep), we can con- 
clude that our sample is complete down to stellar mass 
Mq — 10^° up to z ~ 3. This implies that the scarcity of 
objects with lO^^M© < M* < 10^°-^ in the two highest 
redshift bins of Fig. |T] is a real effect, and it is not due to 
mass incompleteness. 

3. THE COLOR-MASS DIAGRAM AT 1.2 < Z < 3 

In Figure [5] we show the Nuv color-mass diagram 
in four redshift bins (1.2 < z < 1.5, 1.5 < z < 2, 
2 < z < 2.5 and 2.5 < z < 3) for the "parent" cat- 
alog (empty circles) and the ETGs (filled red circles). 
The absolute magnitudes for the Njjv and V filters are 
computed by interpolating the observed photometry at 
the location expected for the GALEX NUV 2500A and 
Johnson V 5500A, respectively. We separate the parent 
catalog in morphologically early- and late-types, based 
on the visual H-band morphology. 

We observe a color bimodality up to z ~ 2, with the 
red peak dominated by ETGs and the blue distribution 
dominated by star forming late-type galaxies. At z > 2 
the color distribution is unimodal, but still the red galax- 
ies are on average more massive than the blue ones. In 
the two highest redshift bins, the bulk of objects displays 
blue colors and only a "tail" of red objects is present. 



4 



Cassata et al. 




Fig. 2. — The N^/y-V color vs mass in the optical rest-frame for 
in four redshift bins: 1.2 < 2 < 1.5, 1.5 < 2 < 2, 2 < 2 < 2.5 and 
2.5 < z < 3. The filled circles, red empty circles and blue empty 
circles represent the passive early— type galaxies (spheroidal mor- 
phology and SSFR< 10~^), the morphologically early— type galax- 
ies and the morphologically late— type galaxies, respectively. The 
black continuous and dashed lines show a fit to the ETGs in the 
first two bins combined. The insets show the color distribution in 
each bin, with the same color coding as the large figures. The red 
continuous and dotted lines show the median mass and scatter for 
the ETGs in the 4 redshift bins. 

Only about 50% of the galaxies on the red sequence at 
such redshift are passive according to our criteria. 

We note that the ETGs are the reddest and most mas- 
sive objects, and that the red sequence is mostly popu- 
lated by galaxies with M* > lO^^-^M©, at all redshifts. 
The average mass for ETGs in the four redshift bins is 
around = IO^^'^Mq, with a dispersion of 0.2 dex. 
This shows that the most massive ETGs are the first to 
form at z > 1.2, and that, as cosmic time goes by, new 
low mass ETGs are formed. This result confirms earlier 
findings according to which the most massive galaxies are 
the first to be quenched (e.g., Bundy et al. 2006; Frances- 
chini et al. 2006; Cimatti, Daddi & Renzini 2006) and 
is in qualitative agreement with the phenomenological 
model of Peng et al. (2010). 

We see that there is a clear correlation between the 
morphology, the color and the star formation activity of 
the galaxies, at least up to z = 2: morphological early- 
type galaxies are typically red and passive; morphologi- 
cal late-types are blue and star forming. This is in good 
agreement with Whitaker et al. (2011), who found a red- 
sequence up to z ~ 3.5 (but the authors did not analyze 
the morphology of the galaxies on the red sequence) , and 
BeU et al. (2012). These findings imply that the Hub- 
ble sequence, in the sense of a correlation between stellar 
mass, star formation properties and morphology, as ob- 
served in the local Universe, is already in place at z ~ 3. 
The mere existence of ETGs at z 3 indicates that such 
objects formed and were quenched at even higher red- 
shift; our estimate of the age is not accurate enough to 



Fig. 3. — The mass size relation in the optical rest-frame for in 
four redshift bins: 1.2 < z < 1.5, 1.5 < 2 < 2, 2 < 2 < 2.5 and 
2.5 < 2 < 3. In all bins, the grey filled region indicates the locus 
occupied by SDSS passive galaxies at < 2 < 0.1: the continuous 
line shows the median of the distribution, and the dashed lines 
contain 68% of the objects (from Cll). Galaxies below the dotted 
line are defined as ultra-compact according to Cll. 

accurately age-date the bulk of the stellar component of 
such objects, but presumably they accreted the bulk of 
their mass about 1 Gyr prior to the observation, implying 
a formation redshift z ^ 5 (similar to Gobat et al. 2012). 

4. THE MASS-SIZE RELATION FOR EARLY-TYPE 
GALAXIES AT 1< Z < 3 

In Figure [3] we show the mass-size relation for the 
107 massive and ETGs at 1.2 < z < 3, in four red- 
shift bins. The size is measured in the WFC3 H-band 
and matches the optical rest-frame in the whole red- 
shift range. The plot shows also the mass-size relation 
for local ETGs drawn from the SDSS as derived by Cll, 
according to whom galaxies below the SDSS local rela- 
tion (the grey strip in Figure [S]) are defined as compact 
ETGs (by definition, 17% of local ETGs are compact), 
and those more than 0.4 dex smaller than local coun- 
terparts of the same mass (the dotted line in Figure [3]) 
are defined as ultra-compact. According to these defini- 
tions, compact and ultra-compact galaxies with a stel- 
lar mass = have respectively sizes smaller 
than about 1 and 0.5 kpc. In our sample, at 2.5 < z < 3, 
2 < z < 2.5, 1.5 < z < 2 and 1.2 < z < 1.5, respec- 
tively -90% (-70%), -90% (-60%), -80% (-50%) and 
-70% (-40%) of the ETGs are compact (ultra-compact) 
ETGs. 

In Figure [4] we report the size as a function of the red- 
shift for our sample of 107 massive ETGs at 1.2 < z < 3, 
together with data at < z < 1.2 from Cll: measure- 
ments for ETGs at 0.3 < z < 1.2 are done in the z-band, 
that matches the optical rest-frame (see Cll for details); 
ETGs in the local universe (0 < z < 0.1) are drawn from 
the SDSS, and the measurements of their size are taken 
by the DR7 NYU Value-Added Galaxy Catalog (Elan- 
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Fig. 4. — Evolution of the size for passive galaxies at < z < 3, 
normalized to the SDSS. The blue and red points represent ETGs 
with Mt < IO^'^'^Mq and with A/, > 10^'^-''Mq, respectively. 
Measures at z < 1.2 (indicated by the dotted vertical line) come 
from Cll. The green filled circles with their error bars represent 
the median size and scatter in bin of redshift. The black, blue and 
red dashed line show the best fit for all the ETGs, the ETGs with 
M, < lO^^'^AfQ and ETGs with M, > IO^^'^Mq, respectively. 



ton et al. 2005, see Cll for details). Galaxy sizes are 
normalized to the average size of the SDSS ETGs with 
the same stellar mass. We also tried to normalize the 
size to the best-fit Re cx Mq^^ relation for each redshift 
(as for example in Cimatti, Nipoti & Cassata 2012), but 
the essence of the results discussed here did not change. 
In the same figure, we report the median normalized size 
for 8 redshifts, along with the scatter of the distribu- 
tion: the 1 — a standard deviation of the distribution 
is about 0.25 dex at all redshifts. We parameterize the 
evolution of the average size as (rg) cx (1 + z)", and we 
fitted the global population of ETGs, as well as ETGs 
with M § M^^-^Mq separately. To avoid the fit be- 
ing completely driven by the ~100,000 SDSS galaxies at 
< z < 0.1, we exclude them from the fit. We find 
1.29 ± 0.10 for the global population of ETGs, 



a = -0.89 ± 0.16 for ETGs with < M^^-'^Mq and 
a = -1.50 ± 0.12 for ETGs with > M^°-^Mq. This 
means that the size evolution is faster for high mass 
ETGs than for the low mass ones, in qualitative agree- 
ment with Ryan et al. (2012). Interestingly, this result 
does not change if only the Cll galaxies with z < 1.2 
are included in the fit: we find a = —1.18 ± 0.15, 
a = -0.90 ± 0.22 and a = -1.33 ± 0.18 for all the 
ETGs, the ETGs with < M^°-^Mq and the ETGs 
with > M^°-^Mq, respectively. 

These values are in good agreement with results by 
Cimatti, Nipoti & Cassata (2012), who found -1.25 < 
a < —0.8, and slightly smaller (in absolute value) than 
the value published by Damjanov et al. (2011), who 
found a = —1.62. However, we stress that the sam- 
ple used here (the 107 ETGs at z > 1.2 plus the 
ETGs at z < 1.2 by Cll is complete in mass down to 
Af, = 1O^°M0 up to z = 3, and it selected according to 
the same criteria at all redshift, thus being more homo- 
geneous and complete than the ones used in these works, 
which are compilations of different samples published in 
literature. 
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Fig. 5. — Top panel: number density as a function of the age of 
the universe and redshift of all ETGs (black circles), normal size 
ETGs (blue diamonds), compact ETGs (green triangles) and ultra- 
compact ETGs (red squares). As a comparison, we report the data 
for passively evolving galaxies from Ilbert et al. (2010, cyan dotted 
line), Ilbert et al. (2013, purple dotted line), Brammer et al. (2011, 
black dotted line) and Moustakas et al. (2013, yellow filled star). 
We also report the lower limit estimate of the number density of 
compact ETGs by Valentinuzzi et al. (2010a, black upward arrow), 
fully compatible with our z ~ estimate based on SDSS. Bottom 
panel: Fraction of passive galaxies that are also compact (green 
triangles) and ultra-compact (red squares), as a function of the 
age of the universe or the redshift. 



5. EVOLUTION OF THE NUMBER DENSITY IN PASSIVE 
GALAXIES AT 1< Z < 3 

In Section 4 we saw that the mass-size relation of 
massive ETGs evolves fast with the redshift between 
z ~ 3 and z ^ 1.2: at z > 2 about 90% of the 
ETGs are compact (i.e. 1 — cr below the local relation), 
with that fraction dropping to 70% at z ^ 1.2. From 
Fig 4 we also learned that the average size of ETGs 
roughly doubles over the same redshift interval. The 
evolution of the average size of galaxies illustrated in in 
Figure 4 can be driven by two simultaneous processes: 
the size increase of each galaxy, possibly due to (minor) 
dry mergers (e.g., Naab et al. 2007; Nipoti et al. 2012; 
Oser et al. 2012) and to the appearance of new ETGs 
with a size distribution progressively shifted to larger 
sizes (Valentinuzzi et al. 2010a, b; Cassata et al. 2011; 
Newman et al. 2012; Poggianti et al. 2012; Car- 
ollo et al. 2013). We know that the number den- 
sity of ETGs dramatically increases with cosmic time 
(i.e. Ilbert et al. 2010; Pozzetti et al. 2010), hence the 
size evolution can not be entirely attributed to the for- 
mer mechanism, as it has been recently attempted by 
Oser et al. (2012). In order to try to constrain the rela- 
tive importance of the two mechanisms, in this section we 
study the number density of ETGs of different physical 
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sizes as a function of the redshift and the cosmic time. 

In Figure [5] we report the evolution of the number den- 
sity of ETGs between z = 3 and z = 1.2 from this work, 
together with the same values at < z < 1.2 taken 
from Cll. The only difference with Cll is that here we 
rebinned the data at < 2; < 1.2 in a slightly differ- 
ent manner, to better sample the range 0.1 < z < 0.7, 
that covers about half of the age of the Universe and 
was sampled with only one bin in Cll. We stress that 
the uniqueness of this work is that we exploit for the first 
time a sample of ETGs complete down to M^, = 10^'-' Mq, 
extending the analysis of Cll up to z ~ 3, when the 
Universe was just 2 Gyr old. We find that the number 
density of ETGs increases by a factor of 50 in a time 
span of only 3.5 Gyrs, between z ^ 3 and z ^ 1. Thus, 
the epoch 1 < z < 3 is critical for the build-up of the 
number of massive ETGs, in good agreement with e.g., 
Ilbert et al. (2010, 2013) and with the phenomenolog- 
ical model of Peng et al. (2010). The number density 
of ETGs then further doubles in the 10 Gyr time span 
between z ^ 1 and z ~ 0, which is primarily due to 
the increase of low-mass ETGs (e.g., Ilbert et al. 2010; 
Franceschini et al. 2006; Cimatti, Daddi & Renzini 2006; 
Peng et al. 2010). Our measurements for the number 
density evolution of the global population of ETGs is 
in good agreement with Ilbert et al. (2010, 2013) and 
Brammer et al. (2011), obtained with much larger and 
less cosmic variance prone samples. 

We see that the first ETGs to appear at z ~ 3 are 
virtually all compact or ultra-compact. The very rapid 
increase in the number density of the global population of 
ETG between z ~ 3 and z ~ 1 is caused by the fast and 
steady increase of the number densities of all normal, 
compact, and ultra-compact ETGs. By z ^ 1.5, the 
normal ETGs become more numerous than the ultra- 
compact, and at z ~ 1 they are the most common ETGs, 
exceeding the number of compact ETGs. 

After z 1, the number density of normal ETGs keeps 
steadily increasing, albeit at a modest rate. At the same 
time, the evolution of compact and ultra-compact ETGs 
decouples from that of the large ETGs, with their num- 
ber density steadily decreasing. In the local Universe, the 
compact galaxies are (by definition, as they are defined to 
be 1-(T below the local relation) 17% of the global popula- 
tion of ETGs, and the ultra-compact formally disappear 
almost completely by z ^ 0. However, it has been ar- 
gued that the SDSS database may be biased against very 
compact galaxies (Scranton et al. 2002; Shih & Stock- 
ton 2011; Valentinuzzi et al. 2010a; CaroUo et al. 2013). 
We note that our estimate of the number density of com- 
pact ETGs at z ~ is '-^5 times higher than the lower 
limit determined by Valentinuzzi et al. (2010a), thus for- 
mally in agreement, and that our new point at z ~ 0.4 
seems to confirm the mild decrease. From z ~ 1 to z ~ 
the number density of compact ETGs decreases by a fac- 
tor of around 2. The picture is less clear for the ultra- 
compact ETGs: the SDSS imaging does not offer enough 
spatial resolution to identify ultra-compact ETGs, and 
thus we indicate the number density of ultra-compact 
ETGs as a lower limit in Figure 5. The new point at 
z ~ 0.4 seems to support the fast decrease of the the 
number density of ultra-compact ETGs at < z < 1, but 
the large error bar does not help to unambiguously fix 
the issue. In the end, the evolution seems to be strongly 



size-dependent, with a faster decrease for smaller ETGs, 
even though we can not draw any strong conclusion due 
to the z ~ uncertainties. This differential evolution 
is in quite good agreement with recent results by Car- 
oUo et al. (2013), who found a even milder decrease of 
the number density of ETGs with sizes smaller than 2 kpc 
(that is slightly larger than our "compactness" definition: 
according to our definition, ETGs of M* = lO^^-^M© are 
compact if their size is smaller than 1 kpc). 

6. DISCUSSION AND CONCLUSIONS 

In this paper we analyzed a sample of passive (SSFR< 
lO-^Gyr-i) and massive (M* > IO^^Mq) ETGs at 
1.2 < z < 3. The deep H-band imaging ensures that 
the sample is complete down to = lO^^M©, about 
an order of magnitude below M* (Ilbert et al. 2010) 
and allows us to study the morphology of our galaxies 
in the optical rest-frame up to z ~ 3. Moreover, the 
dense multiwavelength coverage ensures robust redshift 
measurements (for the objects without a spectroscopic 
redshift), accurate mass and star formation activity de- 
terminations. The Herschel and Spitzer ancillary data 
allowed us to reinforce the passivity selection based on 
the SED fitting procedure. 

The key finding of this paper are: 1. the identification 
up to z 3 of passive and massive ETGs, that dominate 
the red sequence (well defined up to z 2, less clear at 
2 < z < 3, see Figure [2]); 2. the accurate determination, 
with the evolution of average size for ETGs at < z < 3, 
reported in Figure |4l 3. the robust determination of the 
differential evolution of the spatial abundance of massive, 
i.e. stellar mass M, > 10^° Mq, passive galaxies as a 
function of redshift and as a function of the their size, a 
proxy of their average projected central stellar density, 
shown in Figured 

At high redshift, i.e. z ^ 3, the size distribu- 
tion of ETGs is heavily tilted toward small sizes, and 
massive and passive galaxies are predominantly high- 
stellar density systems. This implies that the mecha- 
nism through which these galaxies accrete their mass and 
eventually become passive at those early epochs leaves 
a remnant that is very compact (Cimatti et al. 2008). 
Many different mechanisms leading to the formation 
of compact remnants have been proposed in litera- 
ture: Dekel et al. (2009) argued that disk instabili- 
ties lead to the formation of compact remnants; Hop- 
kins et al. (2008), Wuyts et al. (2010) and Bour- 
naud et al. (2011) showed that gas-rich mergers can form 
compact galaxies; Johansson, Naab, & Ostriker (2012) 
proposed that compact galaxies can formed by in situ 
star formation driven by cold fiows. 

By z ^ 1 the abundance of small ETGs reaches its 
peak, followed by a decline, apparently faster for the 
ultra-compact than for the compact ones (see Section 5 
and discussion of the uncertainties for z < 0.5). Appar- 
ently, whatever set of mechanisms was producing ultra- 
compact passive galaxies at 1 < z < 2.5 is not working 
anymore at later epochs. Moreover, the size distribution 
of the new ETGs that become passive between z ~ 1 
and z ~ is gradually moving towards larger sizes, im- 
plying that the mechanism through which star formation 
is quenched in these massive ETGs acts in such a way as 
to leave larger quenched remnants as (cosmic) time goes 
by, a tendency which accelerates at z < 1. This may be a 
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natural consequence of the inside-out growth of the disks 
(a widely entertained notion, Samland & Gerhard 2003; 
Brook et al. 2006; Muhoz-Mateos et al. 2011), with such 
disks being the likely precursors to passive ETGs. So, 
bigger disks would leave bigger ETG remnants. 

In the < z < 1 epoch the average size of the ETGs 
steadily increases (see Figure 4) . This evolution might be 
due to two different mechanisms (or a mix of the two): 
on one hand, the individual ETGs increase in size, by 
minor merging or accretion; on the other hand new al- 
ready large ETGs, that never passed through the com- 
pact or ultra-compact phase, appear. In this respect, 
the evolution of the number densities of ETGs of dif- 
ferent sizes, constrained in Figure 5, can help to weigh 
the relative importance of two mechanisms. In partic- 
ular, even assuming that all the disappearing compact 
and ultra-compact ETGs are transformed in ETGs of 
normal size and density, in a cascade of merging and in- 
teractions to form normal-sized galaxies (e.g. Huang et 
al. 2012), Figure 5 shows that they are not numerous 
enough to support the increase of the total number of 
normal ETGs observed over the same time interval. In 
fact, between z ~ 1 and z ~ the number density all 
ETGs increases by about ^ 0.8 x 10~'^Afpc~'^, while the 
number density of compact ETGs that disappear is only 
~ 0.3 X 10~^Mpc~^. This implies that the observed rate 
at which compact galaxies disappear can not sustain the 
observed increase of the number of normal ETGs; thus 
the growth of the individual ETGs is not the dominant 
mechanism causing the average size of ETGs to increase 
(Fig. 131), in agreement with Carollo et al. (2013). 

At the same time, the decrease of the number density 
of compact and ultra-compact ETGs between z ~ 1 and 
z ~ indicates that a fraction of these galaxies either be- 
come active, because of a sudden refurbishment of fresh 
gas, or -more likely- they grow in size, via minor merging 
or smooth accretion, becoming normal-sized ETGs. 

Before concluding, we also wish to comment on the 
cosmic rise and fall of the ultra-compact passive galax- 
ies that this paper has documented. The extreme com- 
pactness and large stellar mass of these galaxies argue 
against hierarchical merging of stellar systems as the pri- 
mary mechanism for the assembly of their stellar mass 
(Giavalisco et al. in prep.). Rather, it is likely that they 
emerged in their final state after the quenching of star- 
forming galaxies of the same size and density. In this case 
the rise and fall of ultra-compact ETGs are the result 
of four different processes: 1) the rate at which ultra- 
compact star-forming galaxies form in this mass range 
(roughly M* > 10^° Mq) ; 2) the rate at which they are 
destroyed before they quench, i.e. they are transformed 
from ultra-compact into galaxies of lower density by 
merging and interactions; 3) the rate at which the ultra- 
compact star-forming galaxies shut down star formation 
and become passive; 4) the rate at which the ultra- 
compact passive galaxies are transformed into galaxies 
of lower stellar density by merging and interactions. 1) 
and 2), although not directly constrained by this work, 
define the "reservoir" of compact star-forming galaxies 
from which compact ETGs can form; the effects of mech- 
anism 3) and 4) in shaping the number density evolution 
of compact ETGs has been extensively discussed in this 
paper. 

Interestingly, Wuyts et al. (2011) and 



Barro et al. (2013) identified a rich population of 
compact star forming galaxies at 1.5 < z < 3, whose 
number densities and physical properties are compat- 
ible with being the progenitors of the compact and 
ultra-compact ETGs. The authors speculate that 
those compact star forming galaxies are formed from a 
gas-rich process (merger or disk instabilities) that at 
first induce a compact starburst and then feed an AGN, 
which quenches the star formation turning these objects 
into compact ETGs. They find that these compact star 
forming objects are very rare at z < 1.5, supporting our 
conjecture that after that epoch the mechanism through 
which ETGs are formed produce a large remnant. 
However, the definition of ETGs in Barro et al. (2013), 
as well as the definition of which star forming galaxies 
can transform in ETGs differ from ours. So, we looked 
for progenitors of our ETGs in a companion paper, 
Williams et al. (2013, in preparation): we have identified 
a sample of compact star forming galaxies which may 
be progenitors of our high-redshift passive sample, 
(which represent some of the first ETGs to appear in the 
universe). This sample have consistent SFRs and stellar 
masses with our ETGs, and are evolutionarily consistent 
assuming physically motivated SFHs. This sample 
demonstrates that massive and active enough compact 
SF galaxies exist at z > 3 to account for the number 
density of compact ETGs presented here, assuming 
mass buildup by purely in-situ star-formation, further 
justifying the evolutionary link between compact SF 
and compact passive galaxies. 
To summarize, our main findings are: 

• We find that at 1 < z < 3 the passively evolv- 
ing ETGs are the reddest and most massive ob- 
jects in the Universe. This implies that an embryo 
of the Hubble Sequence, in the sense of a corre- 
lation between morphology, mass, color and star- 
formation activity of galaxies, is already in place 
at z 3. We observe a scarcity of ETGs with 

< lO^O-^M©, with the majority of our ETGs 
having > Since we accurately set 

our mass completeness to M^, = lO^^M©, we can 
conclude that that scarcity is not due to an observa- 
tional bias. Hence, at z > 1.2 the mechanism pro- 
ducing ETGs leaves a remnant that is preferentially 
very massive. This result reinforces previous claims 
for a "downsizing" pattern of the mass assembly 
of ETGs. A possible interpretation is that at that 
early epoch the process that suddenly quenches the 
star-formation activity in some objects, transform- 
ing them into passively evolving ETGs, is effective 
preferentially for objects with stellar mass above 
Af* = IO^^'^Mq. This result is in qualitative agree- 
ment with Peng et al. (2010), who highlighted the 
prominent role of mass quenching at high redshift. 

• We measure a significant evolution of the mass- 
size relation of ETGs from z ~ 3 to z ~ 1, with 
the average size of galaxies increasing by roughly 
a factor of ~2 over this redshift interval, corre- 
sponding to 3 Gyrs of cosmic time. The evo- 
lution of the size of ETGs is faster for galaxies 
with M, > IO^O-^Mq than for those with M, < 
10^°-^ Mq. About 90% (70%) of the ETGs at z > 2 
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are compact (ultra compact). Wc find that the 
average size of ETGs between z ^ 3 and z ^ 
evolves with the redshift following a simple power 
law re ~ (1 + z)", with a = -1.18 ± 0.15. If ETGs 
with M ^ M^"-^Mq arc fitted separately, we find 
a marginally steeper power a for the most massive 
ETGs (a = —1.33 ± 0.18), indicating a faster size 
evolution. 

• We witness the build up of the most massive ETGs, 
with their number density increasing by 50 times 
between ^; ^ 3 and 2 ~ 1. We find that 90% of 
ETGs at 2; > 2 are compact or ultra-compact, in- 
dicating that the event through which such first 
ETGs accrete their mass leaves a remnant that is 
very compact. As the cosmic time goes by, new 
ETGs tend to be larger and larger, with the "nor- 
mal" sized ETGs (meaning those objects with sizes 
comparable to the local ETGs of the same mass) 
becoming the most common ETGs at 2; ~ 1. At 
z > 1 the mechanisms creating new ultra-compact 
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